94 research outputs found
Genome-wide association analyses of North American Rheumatoid Arthritis Consortium and Framingham Heart Study data utilizing genome-wide linkage results
The power of genome-wide association studies can be improved by incorporating information from previous study findings, for example, results of genome-wide linkage analyses. Weighted false-discovery rate (FDR) control can incorporate genome-wide linkage scan results into the analysis of genome-wide association data by assigning single-nucleotide polymorphism (SNP) specific weights. Stratified FDR control can also be applied by stratifying the SNPs into high and low linkage strata. We applied these two FDR control methods to the data of North American Rheumatoid Arthritis Consortium (NARAC) study and the Framingham Heart Study (FHS), combining both association and linkage analysis results. For the NARAC study, we used linkage results from a previous genome scan of rheumatoid arthritis (RA) phenotype. For the FHS study, we obtained genome-wide linkage scores from the same 550 k SNP data used for the association analyses of three lipids phenotypes (HDL, LDL, TG). We confirmed some genes previously reported for association with RA and lipid phenotypes. Stratified and weighted FDR methods appear to give improved ranks to some of the replicated SNPs for the RA data, suggesting linkage scan results could provide useful information to improve genome-wide association studies
Region-based analysis in genome-wide association study of Framingham Heart Study blood lipid phenotypes
Due to the high-dimensionality of single-nucleotide polymorphism (SNP) data, region-based methods are an attractive approach to the identification of genetic variation associated with a certain phenotype. A common approach to defining regions is to identify the most significant SNPs from a single-SNP association analysis, and then use a gene database to obtain a list of genes proximal to the identified SNPs. Alternatively, regions may be defined statistically, via a scan statistic. After categorizing SNPs as significant or not (based on the single-SNP association p-values), a scan statistic is useful to identify regions that contain more significant SNPs than expected by chance. Important features of this method are that regions are defined statistically, so that there is no dependence on a gene database, and both gene and inter-gene regions can be detected. In the analysis of blood-lipid phenotypes from the Framingham Heart Study (FHS), we compared statistically defined regions with those formed from the top single SNP tests. Although we missed a number of single SNPs, we also identified many additional regions not found as SNP-database regions and avoided issues related to region definition. In addition, analyses of candidate genes for high-density lipoprotein, low-density lipoprotein, and triglyceride levels suggested that associations detected with region-based statistics are also found using the scan statistic approach
A case report and genetic characterization of a massive acinic cell carcinoma of the parotid with delayed distant metastases.
We describe the presentation, management, and clinical outcome of a massive acinic cell carcinoma of the parotid gland. The primary tumor and blood underwent exome sequencing which revealed deletions in CDKN2A as well as PPP1R13B, which induces p53. A damaging nonsynonymous mutation was noted in EP300, a histone acetylase which plays a role in cellular proliferation. This study provides the first insights into the genetic underpinnings of this cancer. Future large-scale efforts will be necessary to define the mutational landscape of salivary gland malignancies to identify therapeutic targets and biomarkers of treatment failure
Pathologic gene network rewiring implicates PPP1R3A as a central regulator in pressure overload heart failure
Heart failure is a leading cause of mortality, yet our understanding of the genetic interactions underlying this disease remains incomplete. Here, we harvest 1352 healthy and failing human hearts directly from transplant center operating rooms, and obtain genome-wide genotyping and gene expression measurements for a subset of 313. We build failing and non-failing cardiac regulatory gene networks, revealing important regulators and cardiac expression quantitative trait loci (eQTLs). PPP1R3A emerges as a regulator whose network connectivity changes significantly between health and disease. RNA sequencing after PPP1R3A knockdown validates network-based predictions, and highlights metabolic pathway regulation associated with increased cardiomyocyte size and perturbed respiratory metabolism. Mice lacking PPP1R3A are protected against pressure-overload heart failure. We present a global gene interaction map of the human heart failure transition, identify previously unreported cardiac eQTLs, and demonstrate the discovery potential of disease-specific networks through the description of PPP1R3A as a central regulator in heart failure
BR-squared: a practical solution to the winner’s curse in genome-wide scans
The detrimental effects of the winner’s curse, including overestimation of the genetic effects of associated variants and underestimation of sufficient sample sizes for replication studies are well-recognized in genome-wide association studies (GWAS). These effects can be expected to worsen as the field moves from GWAS into whole genome sequencing. To date, few studies have reported statistical adjustments to the naive estimates, due to the lack of suitable statistical methods and computational tools. We have developed an efficient genome-wide non-parametric method that explicitly accounts for the threshold, ranking, and allele frequency effects in whole genome scans. Here, we implement the method to provide bias-reduced estimates via bootstrap re-sampling (BR-squared) for association studies of both disease status and quantitative traits, and we report the results of applying BR-squared to GWAS of psoriasis and HbA1c. We observed over 50% reduction in the genetic effect size estimation for many associated SNPs. This translates into a greater than fourfold increase in sample size requirements for successful replication studies, which in part explains some of the apparent failures in replicating the original signals. Our analysis suggests that adjusting for the winner’s curse is critical for interpreting findings from whole genome scans and planning replication and meta-GWAS studies, as well as in attempts to translate findings into the clinical setting
Recommended from our members
<i>De Novo</i> and Rare Variants at Multiple Loci Support the Oligogenic Origins of Atrioventricular Septal Heart Defects
Congenital heart disease (CHD) has a complex genetic etiology, and recent studies suggest that high penetrance de novo mutations may account for only a small fraction of disease. In a multi-institutional cohort surveyed by exome sequencing, combining analysis of 987 individuals (discovery cohort of 59 affected trios and 59 control trios, and a replication cohort of 100 affected singletons and 533 unaffected singletons) we observe variation at novel and known loci related to a specific cardiac malformation the atrioventricular septal defect (AVSD). In a primary analysis, by combining developmental coexpression networks with inheritance modeling, we identify a de novo mutation in the DNA binding domain of NR1D2 (p.R175W). We show that p.R175W changes the transcriptional activity of Nr1d2 using an in vitro transactivation model in HUVEC cells. Finally, we demonstrate previously unrecognized cardiovascular malformations in the Nr1d2tm1-Dgen knockout mouse. In secondary analyses we map genetic variation to protein-interaction networks suggesting a role for two collagen genes in AVSD, which we corroborate by burden testing in a second replication cohort of 100 AVSDs and 533 controls (p = 8.37e-08). Finally, we apply a rare-disease inheritance model to identify variation in genes previously associated with CHD (ZFPM2, NSD1, NOTCH1, VCAN, and MYH6), cardiac malformations in mouse models (ADAM17, CHRD, IFT140, PTPRJ, RYR1 and ATE1), and hypomorphic alleles of genes causing syndromic CHD (EHMT1, SRCAP, BBS2, NOTCH2, and KMT2D) in 14 of 59 trios, greatly exceeding variation in control trios without CHD (p = 9.60e-06). In total, 32% of trios carried at least one putatively disease-associated variant across 19 loci,suggesting that inherited and de novo variation across a heterogeneous group of loci may contribute to disease risk
- …